Goto

Collaborating Authors

 normal estimation




SfPUEL: Shape from Polarization under Unknown Environment Light

Neural Information Processing Systems

DeepSfP (4), which is even comparable with the multiview SfP method P ANDORA (15). In addition, metallic and dielectric surfaces exhibit different polarization BRDFs under the same illumination, which causes AoLP maps to vary on different materials, further compounding the normal estimation problem.


NeuralGF: Unsupervised Point Normal Estimation by Learning Neural Gradient Function -- Supplementary Material -- Qing Li

Neural Information Processing Systems

We provide optimization time ( i. e ., training time in the bracket) and inference time of our method. Our method improves the state-of-the-art results while using much fewer parameters. The surfaces are reconstructed from point clouds with low noise (a) and high noise (b). Fig 2, we show the reconstructed surfaces on point clouds with different noise levels. A partially enlarged view is provided for each shape.


NeuralGF: Unsupervised Point Normal Estimation by Learning Neural Gradient Function Qing Li

Neural Information Processing Systems

Normal estimation for 3D point clouds is a fundamental task in 3D geometry processing. The state-of-the-art methods rely on priors of fitting local surfaces learned from normal supervision.


HSurf-Net: NormalEstimationfor3DPointCloudsby LearningHyperSurfaces

Neural Information Processing Systems

However, fitting surfaces explicitly from raw point clouds suffers from overfitting or underfitting issues caused by inappropriatepolynomial orders andoutliers, which significantly limits theperformance of existing methods.


HSurf-Net: Normal Estimation for 3D Point Clouds by Learning Hyper Surfaces

Neural Information Processing Systems

We propose a novel normal estimation method called HSurf-Net, which can accurately predict normals from point clouds with noise and density variations. Previous methods focus on learning point weights to fit neighborhoods into a geometric surface approximated by a polynomial function with a predefined order, based on which normals are estimated. However, fitting surfaces explicitly from raw point clouds suffers from overfitting or underfitting issues caused by inappropriate polynomial orders and outliers, which significantly limits the performance of existing methods. To address these issues, we introduce hyper surface fitting to implicitly learn hyper surfaces, which are represented by multi-layer perceptron (MLP) layers that take point features as input and output surface patterns in a high dimensional feature space. We introduce a novel space transformation module, which consists of a sequence of local aggregation layers and global shift layers, to learn an optimal feature space, and a relative position encoding module to effectively convert point clouds into the learned feature space. Our model learns hyper surfaces from the noise-less features and directly predicts normal vectors. We jointly optimize the MLP weights and module parameters in a data-driven manner to make the model adaptively find the most suitable surface pattern for various points. Experimental results show that our HSurf-Net achieves the state-of-the-art performance on the synthetic shape dataset, the real-world indoor and outdoor scene datasets. The code, data and pretrained models are publicly available.


SfPUEL: Shape from Polarization under Unknown Environment Light

Neural Information Processing Systems

DeepSfP (4), which is even comparable with the multiview SfP method P ANDORA (15). In addition, metallic and dielectric surfaces exhibit different polarization BRDFs under the same illumination, which causes AoLP maps to vary on different materials, further compounding the normal estimation problem.


NeuralGF: Unsupervised Point Normal Estimation by Learning Neural Gradient Function Qing Li

Neural Information Processing Systems

Normal estimation for 3D point clouds is a fundamental task in 3D geometry processing. The state-of-the-art methods rely on priors of fitting local surfaces learned from normal supervision.


From Editor to Dense Geometry Estimator

arXiv.org Artificial Intelligence

Leveraging visual priors from pre-trained text-to-image (T2I) generative models has shown success in dense prediction. However, dense prediction is inherently an image-to-image task, suggesting that image editing models, rather than T2I generative models, may be a more suitable foundation for fine-tuning. Motivated by this, we conduct a systematic analysis of the fine-tuning behaviors of both editors and generators for dense geometry estimation. Our findings show that editing models possess inherent structural priors, which enable them to converge more stably by ``refining" their innate features, and ultimately achieve higher performance than their generative counterparts. Based on these findings, we introduce \textbf{FE2E}, a framework that pioneeringly adapts an advanced editing model based on Diffusion Transformer (DiT) architecture for dense geometry prediction. Specifically, to tailor the editor for this deterministic task, we reformulate the editor's original flow matching loss into the ``consistent velocity" training objective. And we use logarithmic quantization to resolve the precision conflict between the editor's native BFloat16 format and the high precision demand of our tasks. Additionally, we leverage the DiT's global attention for a cost-free joint estimation of depth and normals in a single forward pass, enabling their supervisory signals to mutually enhance each other. Without scaling up the training data, FE2E achieves impressive performance improvements in zero-shot monocular depth and normal estimation across multiple datasets. Notably, it achieves over 35\% performance gains on the ETH3D dataset and outperforms the DepthAnything series, which is trained on 100$\times$ data. The project page can be accessed \href{https://amap-ml.github.io/FE2E/}{here}.